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Abstract 

Background: Mouse limb bud is a prime model to study the regulatory interactions that control vertebrate 
organogenesis. Major aspects of limb bud development are controlled by feedback loops that define a self-regulatory 
signalling system. The SHH/GREM1/AER-FGF feedback loop forms the core of this signalling system that operates 
between the posterior mesenchymal organiser and the ectodermal signalling centre. The BMP antagonist Gremlin 1 
(GREM1) is a critical node in this system, whose dynamic expression is controlled by BMP, SHH, and FGF signalling and 
key to normal progression of limb bud development. Previous analysis identified a distant c/'s-regulatory landscape 
within the neighbouring Forminl {Fnnnl) locus that is required for Greml expression, reminiscent of the genomic 
landscapes controlling HoxD and Shh expression in limb buds. 

Results: Three highly conserved regions (HMC01-3) were identified within the previously defined critical genomic 
region and tested for their ability to regulate Greml expression in mouse limb buds. Using a combination of BAC and 
conventional transgenic approaches, a 9 kb region located -70 kb downstream of the Greml transcription unit was 
identified. This region, termed Greml Regulatory Sequence 1 {GRS1), is able to recapitulate major aspects of Greml 
expression, as it drives expression of a LocZ reporter into the posterior and, to a lesser extent, in the distal-anterior 
mesenchyme. Crossing the GRS1 transgene into embryos with alterations in the SHH and BMP pathways established 
that GRS1 depends on SHH and is modulated by BMP signalling, i.e. integrates inputs from these pathways. Chromatin 
immunoprecipitation revealed interaction of endogenous GLI3 proteins with the core c/s-regulatory elements in the 
GRS1 region. As GLI3 is a mediator of SHH signal transduction, these results indicated that SHH directly controls Greml 
expression through the GRS1 region. Finally, all c/s-regulatory regions within the Greml genomic landscape locate to 
the DNAse I hypersensitive sites identified in this genomic region by the ENCODE consortium. 

Conclusions: This study establishes that distant c/s-regulatory regions scattered through a larger genomic landscape 
control the highly dynamic expression of Greml, which is key to normal progression of mouse limb bud development. 



Background 

During embryonic development, the spatio-temporal ex- 
pression of transcriptional regulators and morphogenetic 
signals is tightly controlled. In fact, different types of con- 
genital malformations are caused by mutations affecting 
specific c/s-regulatory and non-coding genomic regions 
that alter the expression of genes or gene clusters in 
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specific tissues (reviewed in ref. [1-3]). Genome-wide 
functional mapping revealed many of the large genomic 
landscapes that control the expression of developmental 
regulator genes in a global and tissue-specific manner [4]. 
This study and others reveal that the cis- regulatory 
regions controlling gene expression in a particular tissue 
are often located several hundred kilobases (kb) or even 
megabases (Mb) up- or downstream of the transcriptional 
start sites in other loci, and may control the expression of 
several genes [5-7]. Recently, it has been shown how mul- 
tiple ds-regulatory regions interact to control the 
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expression of the 5'HoxD gene complex in the presump- 
tive digit territory. The transcriptionally active part of 
the 5'HoxD gene cluster forms a so-called regulatory 
archipelago in which dispersed ds-regulatory elements co- 
operate to control gene expression in the distal limb bud 
by interacting with proximal promoters over large dis- 
tances [8]. In fact, regulatory landscapes with distant and 
dispersed ds-regulatory regions seem to be a recurring 
theme in the dynamic spatio-temporal regulation of genes 
that are essential for limb bud development (reviewed in 
ref. [2,9,10], see also below). 

Vertebrate limb bud development is controlled by 
interactions between two main signalling centres, the 
apical ectodermal ridge (AER) and the zone of polarizing 
activity (ZPA) located in the posterior limb mesenchyme. 
These two signalling centres interact as part of a self- 
regulatory feedback signalling system involving several sig- 
nalling pathways [11]. In particular, ZPA cells produce the 
Sonic Hedgehog (SHH) signal, which together with 
GREMLIN1 -mediated antagonism of Bone Morphogenetic 
Proteins (BMPs) in the posterior-distal limb bud mesen- 
chyme propagates Fibroblast Growth Factor (FGF) signal- 
ling in the AER [11-14]. This SHH/GREM1/AER-FGF 
feedback-signalling loop promotes distal progression of 
limb bud outgrowth and formation of the autopod which 
gives rise to carpals and digit rays (reviewed in ref. [15]). 
SHH expression in the posterior limb bud mesenchyme is 
controlled by a ds-regulatory region located about 800 kb 
upstream of the transcriptional start site. Deletion of this 
ZPA regulatory region (ZRS) results in loss of distal limb 
structures [16], while point mutations within the ZRS result 
in anterior ectopic Shh expression and duplications of the 
thumb and/or anterior digits in different species [17,18]. In 
cells actively transcribing Shh, the ZRS loops out of its 
chromosomal territory to the Shh transcription unit, which 
reveals how distant ds-regulatory elements control gene 
expression [19]. Recently, several transcriptional regulators 
have been identified that control Shh expression by directly 
interacting with the distant ZRS region. These include 
HOX proteins, the bHLH transcription factor HAND2, and 
ETS transcription factors, providing a glimpse of the com- 
plex transcriptional regulation of Shh in the posterior limb 
bud mesenchyme [20-22], 

SHH signalling is transduced by the GLI family of 
transcriptional regulators and inhibits the constitutive 
processing of the full-length GLI3 activator to its repres- 
sor form GLI3R (see e.g. ref. [23]). Of the three GLI 
transcription factors expressed during limb bud develop- 
ment, only GU3 is essential on its own (see e.g. ref. [24]). 
The inactivation of GU3 alters morphogenetic feedback 
signalling and results in formation of additional anterior 
digits [23,25-27]. In particular, GLI3 restrains the prolif- 
eration of mesenchymal progenitors in the anterior limb 
bud mesenchyme and promotes initiation of digit ray 



chondrogenesis by directly repressing Greml expression 
during handplate formation [26]. 

We have previously shown that the expression of 
Greml in the limb bud mesenchyme is controlled by a 
large genomic landscape downstream of its transcription 
unit [7]. In fact, several of the classical limb deformity 
mutations that disrupt distal limb bud development and 
formation of metanephric kidneys in the mouse are 
caused by deletions affecting this ds-regulatory land- 
scape rather than directly altering the Greml gene 
products [7,13,28]. Molecular and genetic analysis in 
the mouse identified a 70 kb genomic region located 
downstream of the Greml transcription unit within the 
neighbouring Forminl (Fmnl) gene that is required 
for Greml expression in the limb bud mesenchyme 
[7,29]. Detailed analysis of this Greml genomic land- 
scape revealed similarities with the global control re- 
gion (GCR) that controls the expression of 5'HoxD 
genes in the limb bud mesenchyme [6], but did not 
reveal the structural nature and transacting factors 
and/or signalling pathways controlling these ds-regulatory 
regions. 

To gain further insights into the Greml landscape, the 
potential cis- regulatory activities of the three highest 
evolutionarily conserved genomic regions within the 
70 kb Greml critical region were analysed. In addition 
to its expression in the posterior mesenchyme, emphasis 
was also given to the coordinated distal-anterior expan- 
sion of Greml expression, which is key to orderly pro- 
gression of limb bud development [14,30]. Combining 
BAC with conventional transgenic approaches, we iden- 
tified a 9 kb genomic region that is able to recapitulate 
major but not all aspects of the dynamic spatio-temporal 
Greml expression in the limb bud mesenchyme. This 
9 kb region was termed Greml Regulatory Sequence 1 
(GRS1), and contains a core sequence that is essential to 
express a LacZ transgene under control of a fi-globin 
minimal promoter in a Greml -like pattern. The GRS1 
region drives gene expression into the posterior and 
subsequently distal-anterior mesenchyme, i.e. reproduces 
aspects of the distal- anterior expansion of Greml expres- 
sion during progression of limb bud development. By 
crossing GRS1 transgenic mice into different mutant con- 
texts, we establish that the GRS1 region is controlled by 
inputs from both the SHH and BMP signalling pathways 
in limb buds. In addition, chromatin immunoprecipitation 
(ChIP) shows that GLI3 interacts with the conserved 
HMCOl sequences in the GRS1 region. The functionally 
relevant cis- regulatory regions identified by the present 
and two previous studies [31,32] map to the endogenous 
DNAse I hypersensitive sites within the Greml genomic 
landscape recently identified by the ENCODE consortium 
[33,34] . This indicates that the Greml cis -regulatory land- 
scape is composed of at least five active cis- regulatory 
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regions that control the spatio-temporal expression of 
Greml in mouse embryos and limb buds. 

Results and Discussion 

Identification of conserved limb bud regulatory regions 
within the Greml genomic landscape 

Using functional genetics in the mouse, we previously 
identified a -70 kb region located downstream of Greml 
that is required for its expression in the limb bud mesen- 
chyme (Figure 1A, [7]). This limb bud cis -regulatory re- 
gion is located between coding exons 19 and 23 of the 
neighbouring Fmnl gene. The Evolutionary Conserved 
Regions (ECR) genome browser (http://ecrbrowser.dcode. 
org/) was used for multiple sequence alignments to com- 
pare the mouse genome with different mammalian and 
the chicken genomes. This analysis revealed several blocks 
of highly conserved sequences among mammalian species, 
but only three of them were also highly conserved in the 
chicken and termed HMCOl, 2 and 3 (Human-Mouse- 
Chicken-Opossum conserved sequences 1 to 3, 
Figure 1A). The most conserved parts of these three 
HMCO core regions are -80% identical (for details see 
Additional file 1 and Additional file 2). Of these three 
regions only HMCOl is also present in the orthologous 
genomic region in frogs (Figure 1A), which express Greml 
during limb bud development [35]. Despite the fact that 
Greml is expressed during fin bud development [36], no 
HMCO homologies are present in the zebrafish genome 
(Figure 1A). ClustalW2 alignments (http://www.ebi.ac.uk/ 
Tools/msa/clustalw2/) of the Fmnl-Greml regions from 
different species revealed the absence of HMCO homolo- 
gies in other ray-finned fish species such as fugu and me- 
daka (Figure IB). In contrast, all three HMCO regions are 
present in the genome of coelacanth, a lobe-finned fish 
closely related to tetrapods (Figure IB and Additional file 
1) [37]. As the Greml and Fmnl loci are linked in all spe- 
cies analysed (Figure IB), the distance between the ortho- 
logous Fmnl coding exon 22 (located adjacent to 
HMCOl) and Greml coding exon 2 was determined. Dur- 
ing evolution, this distance increased as it is smallest in 
the genomes of ray finned fishes and at least -10-fold lar- 
ger in tetrapods and coelacanth (Figure IB). In addition, 
the distance between Fmnl exon 19 (close to HMC03) 
and exon 22 correlates well with the presence or absence 
of the three HMCO regions (Figure IB and data not 
shown). In ray-finned fishes and frogs (lacking all or spe- 
cifically HMC02, 3, respectively), the distance is -3-7 kb, 
while in mammals and coelacanth it ranges from -50- 
70 kb. Thus, the increase of intronic and intergenic 
regions in the Fmnl-Greml landscape correlates well with 
appearance of the three HMCO regions during vertebrate 
evolution. 

To determine the requirement of each of these three 
HMCO regions for Greml expression in mouse limb 



buds, we used a BAC-based strategy in combination with 
analysis of transgenic founder embryos (Figure 2A-D 
and Additional file 3). A 250 kb mouse genomic BAC 
encoding the critical region and the Greml transcription 
unit was used to fuse the LacZ gene in frame with the 
Greml ORF [7] . We assessed the expression of the LacZ 
reporter transgene by analysing the spatio-temporal 
distribution of E-galactosidase activity (Figure 2A-D). 
Expression of the Greml -LacZ fusion protein in the 
forelimb bud mesenchyme was detected in a posterior- 
distal domain (Figure 2A), mimicking the early endogen- 
ous Greml expression rather accurately. Therefore, the 
Greml-LacZ BAC construct was used to engineer dele- 
tions of the three HMCO regions and determine their 
requirement for Greml-LacZ expression in the posterior 
mesenchyme. To control for reproducible generation of 
expressing BAC transgenic founder embryos, the 
Greml-LacZ BAC (Figure 2 A) was injected in parallel to 
the BACs with engineered deletions (Figure 2B-D). 
While the control Greml-LacZ BAC was always robustly 
expressed in the posterior mesenchyme (Figure 2A), de- 
letion of the 520 bp HMCOl core region resulted in 
complete loss of LacZ expression from limb buds 
(Figure 2B, n = 5/7 embryos with LacZ expression, for 
details see Additional file 3). In contrast, deletion of the 
1279 bp HMC02 core region only caused partial loss of 
LacZ expression from the posterior limb bud mesen- 
chyme (Figure 2C; Additional file 3). LacZ remained 
expressed normally in the majority of founder embryos 
carrying a 924 bp deletion of the HMC03 region 
(Figure 2D, n = 3/5 embryos with LacZ expression, for 
details see Additional file 3). Taken together, this BAC 
transgenic analysis establishes HMCOl as most critical 
for Greml expression in the posterior-distal limb bud 
mesenchyme. As the HMCOl core region is highly con- 
served in tetrapods and lobe-finned fish but not ray- 
finned fish (Figure IB), it likely represents a ds-regula- 
tory region important to tetrapod evolution. The other 
two HMCO regions might contribute to robust expres- 
sion of Greml-LacZ in the posterior mesenchyme, as in 
particular the deletion of HMC02 results in significantly 
reduced LacZ expression (Figure 2C). 

In an attempt to gain further insight into the enhancer 
potential and possible interactions of HMC02/3 with 
HMCOl, conventional transgenic approaches using a 
minimal human fi-globin promoter (figlob-LacZ) were 
employed. However, neither individual HMCO regions 
(data not shown) nor in a combination of all three was 
able to drive robust expression of the figlob-LacZ trans - 
gene in the posterior limb bud mesenchyme (Figure 3A, 
compare to Figure 2A). Expression of the figlob-LacZ 
transgene under control of all three HMCO core regions 
resulted in scattered LacZ positive cells in the anterior- 
distal mesenchyme of forelimb buds (left panel, 
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(See figure on previous page.) 

Figure 1 Identification of three highly conserved non-coding regions in tetrapods and a lobe-finned fish. (A) Sequence alignment of the 

genomic region critical for Greml expression in mouse limb buds [7] using the ECR browser with the mouse genome release 9 (mm9) as 

reference genome. The critical genomic region on mouse chromosome 2 is shown in centromeric (CEN) to telomeric (TEL) orientation and is 

located downstream of the Greml coding exons. As the critical genomic region is part of the Fmnl locus, the interspersed Fmnl coding exons 

are indicated as open boxes in the scheme. Three blocks of highly conserved non-coding sequences were identified (HMC01-3) and are 

indicated in blue. Conserved coding regions are indicated in black and non-coding regions conserved >74% over > 1 00 bp are coloured salmon. 

The peak detected in the region of HMC01 in the zebrafish genome corresponds to Fmnl exon 22. Regions consisting of repetitive sequences 

are shown in green (see Additional file 2). (B) Conserved linkage between the Greml and Fmnl loci in vertebrates. Increased intergenic distances 

correlate with the presence of HMCO regions in tetrapods and coelacanth in contrast to ray-finned fishes. The phylogenetic tree analysis was 

done with the UCSC multiple alignment functions to align the 3' part of the Fmnl locus and the Greml locus from different species. Open boxes 

represent the orthologous Fmnl coding exons 19 and 22, black box represents Greml coding exon 2. The intergenic distances between Fmnl 

orthologous exon 22 and Greml coding exon 2 are indicated to the right of the scheme. ENSEMBL genomes used for alignment: mouse: M. 

musculus (mm10); human: H. sapiens (hg 1 9); chimpanzee: P. troglodytes (panTro3); dog: C. familiaris (canFam2); bovine: B. taurus (bosTau6); 

opossum: M. domestico (monDom5); chicken: G. gallus (galGal3); lizard: A corolinensis (anoCar2); frog: X. tropicalis (xenTro2); coelacanth: L 

cholumnoe (LatChal); fugu: T. rubripes (fr3); medaka: 0. lotipes (oryLat2); zebrafish: D. rerio (danRer7). 
I J 



Figure 3A, n = 3/3). In contrast, the transgene was strongly 
expressed in the posterior embryo including hindlimb 
buds (right panel, Figure 3A). These results show that a 
transgene consisting of an array of the HMCO core 
regions is unable to drive LacZ expression into the poster- 
ior forelimb bud mesenchyme. This indicates that add- 
itional elements of the Greml genomic landscape are 
required for Greml expression in limb buds. 

Next, we assessed to which extent a larger genomic 
fragment containing HMCOl and flanking regions could 
drive figlob-LacZ expression into the posterior limb bud 
mesenchyme (Figure 3B). Analysis of transgenic founder 
embryos revealed that this transgenic construct drives 
robust figlob-LacZ expression in the posterior limb bud 
mesenchyme (Figure 3B, see also Figure 4), strikingly 
similar to the domain of the parental Greml -LacZ BAC 
construct (Figure 2A). Taken together, these results indi- 
cate that this ds-regulatory region enhances Greml 
expression in the posterior limb bud mesenchyme, 
which prompted us to term this 9 kb region Greml 
Regulatory Sequence 1 (GRS1, Figure 4A). GRS1 encom- 
passes the HMCOl region next to Fmnl coding exon 22 
and a further downstream region that is highly con- 
served in mammals. This region, like HMCOl overlaps 
with a previously identified GLI binding region (GBR, 
[32]). In addition, two potential GLI binding sites were 
identified by in silico analysis (Figure 4A). 

Spatio-temporal activity of the GRS1 region during mouse 
limb bud development 

To analyze comparatively the activity of the GRS1 region 
with respect to the spatio-temporal regulation of Greml 
expression, transgenic mouse strains expressing the 
GRS1 -figlob-LacZ reporter construct were established 
(Figure 4). Seven transgenic founders were obtained. In 
three independent strains, LacZ was expressed in the 
posterior fore- and hindlimb bud mesenchyme in a pat- 
tern comparable to the transgenic founder embryos 



(Figure 3B and data not shown). Two of these three 
transgenic strains were analyzed in detail and used to 
study the spatio-temporal LacZ distribution (Figure 4B, 
C and Additional file 4). Initial analysis revealed that 13- 
galactosidase activity of the LacZ transgene was detected 
earlier in one strain and is significantly higher than in 
the other strain (Figure 4B, compare to Figure 4C). As 
levels did not change during subsequent generations 
(data not shown), this is likely due to differences in 
transgene copy number and/or integration site. There- 
fore, the copy number of both strains was determined 
by real-time qPCR analysis (Additional file 4). This ana- 
lysis revealed that ten copies of the GRSl-figlob-LacZ 
transgene were integrated into the genome of the strain 
expressing higher LacZ levels (GRSl tgl °), while only two 
copies were detected in the GRSl tg2 strain (Additional 
file 4). In GRSl tgl0/+ embryos, ft-galactosidase activity is 
first detected at~E10.25 in the posterior forelimb bud 
mesenchyme and continuously increases until ~ El 1.75 
(Figure 4B). From-Ell.O onwards, scattered positive 
cells were detected in the anterior mesenchyme, and this 
anterior expression increased during distal limb bud out- 
growth to form a crescent in the distal-anterior autopod 
(right panels, Figure 4B). By El 1.75, ft-galactosidase 
activity was rather variable, such that the anterior cres- 
cent separated from the posterior domain in some fore- 
limb buds. In GRSl tg2/+ embryos, the spatio-temporal 
pattern is similar, but expression levels are significantly 
reduced due to the lower transgene copy number, result- 
ing in detection of E-galactosidase activity from only- 
E10.5 onwards (Figure 4C, arrowhead). In both strains, 
GRS1 drove expression of the figlob-LacZ reporter spe- 
cifically in the limb bud mesenchyme, despite some low 
ft-galactosidase activity detected in the developing eyes 
of GRSl tgl0/+ transgenic embryos (Additional file 4). This 
analysis reveals the robust nature of the GRS1 region, 
which functions in positive regulation of Greml expres- 
sion in the limb bud mesenchyme. The posterior domain 
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Figure 2 A BAC-based strategy to assess the requirement of highly conserved non-coding regions (HMCOI-3) for Greml expression in 
mouse limb buds. (A) A 250 kb BAC spanning the genomic region from Fmnl exon 19 to 150 kb up-stream of the Greml transcription is able 
to drive expression of a LocZ transgene (blue arrow; inserted in-frame into the Greml ORF) into the posterior limb bud mesenchyme in 
transgenic founder embryos. (B) Deletion of the HMC01 core region (520 bp) abolishes the LocZ expression. (C, D) Deletion of the core HMC02 
region (1279 bp) results in reduced LocZ expression, while deletion of HMC03 (924 bp) does not alter the LocZ distribution. B-galactosidase 
activity colours expressing cells blue. 



of ft-galactosidase activity at E10.5 (Figure 4B) is compar- 
able to the Greml transcript distribution in wild-type limb 
buds at earlier stages (E9.5-E 10.25, Figure 4D). This tem- 
poral delay is likely due to postponed transcriptional activa- 
tion and/or up-regulation of the GRS1 transgene as the 
establishment of a posterior LacZ expression domain is also 
only apparent at E10.5 in GRSl tgl0/+ transgenic limb buds 
(Additional file 5). Therefore, additional ds-regulatory 
regions likely control the temporally correct early onset of 
Greml expression. Similar delays in the onset of LacZ 
reporter gene expression have been previously observed by 



analyzing the ZRS ds-regulatory region that controls Shh 
expression in mouse limb buds [38]. During distal pro- 
gression of limb bud development, endogenous Greml 
expression expands distal-anterior within the developing 
handplate and begins to fade by El 1.75 (Figure 4D), due to 
termination of the SHH/GREM1/AER-FGF feedback loop 
and GLI3-mediated repression in the anterior limb bud 
mesenchyme [11,26,39,40]. In contrast, E-galactosidase ac- 
tivity remains high in the posterior limb bud mesenchyme 
of GRS1 transgenic embryos and the distal-anterior expan- 
sion of its expression is significantly delayed and only 
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Figure 3 A 9 kb GRS1 transgene encompassing HMC01 drives (Iglob-LacZ expression into the posterior limb bud mesenchyme. (A) A 

transgene encoding the three HMCO core regions downstream of the Bglob-LacZ minimal promoter and reporter results in aberrant B- 
galactosidase activity in the anterior-distal limb bud mesenchyme. An asterisk marks the posterior border of the forelimb bud. A black arrow 
marks the anterior border of LocZ expression in the trunk. A white arrowhead points to the hindlimb bud. (B) The GRS1 -Bglob-LacZ transgene is 
expressed in the posterior limb bud mesenchyme of transgenic founder embryos. The 9 kb GRS1 region was inserted downstream of the Bglob- 
LacZ reporter to keep the same arrangement as in the endogenous Greml locus (Figure 2A). 



occurs as transcription of the endogenous Greml locus is 
starting to terminate (El 1.5 onwards Figure 4B, C, compare 
to Figure 4D). 

Greml expression is restricted to dorsal and ventral 
limb bud mesenchyme and excluded from the chondro- 
genie core mesenchyme (Figure 5A), which is relevant to 
BMP-mediated induction of chondrogenesis (ref. [26,41] 
and Benazet et al, submitted). Furthermore, the LIM- 
homeodomain transcription factors Lhx2 and Lhx9 have 
been implicated in regulating Greml expression pre- 
dominantly in the ventral limb bud mesenchyme in re- 
sponse to SHH signalling [42]. Therefore, the extent to 
which the dorso-ventral transcript distribution is main- 
tained by the GRS1 transgene was assessed (Figure 5B). 
Indeed, the expression of both the high (Figure 5B) and 
low copy transgenes (Additional file 6) remained 
excluded from the core mesenchyme throughout limb 
bud outgrowth and patterning. Similar to Greml tran- 
scripts, E-galactosidase activity is higher dorsally than 
ventrally (Figure 5B, compare to Figure 5A). By El 1.75, 
E-galactosidase activity expands anteriorly in both the 
dorsal and ventral mesenchyme (El 1.75, Figure 5B), 
reminiscent of the crescent of Greml transcript 
(Figure 5A and see before), which indicates that the 
GRS1 contains ds-regulatory regions controlled by Lhx 
transcription factors [42]. Taken together, the results 
shown in Figures 4 and 5 establish that the 9 kb GRS1 
cis- regulatory region is able to recapitulate major aspects 
of Greml expression in the limb bud mesenchyme. In 
particular, the GRS1 also recapitulates aspects of the 
distal- anterior expansion of Greml expression in limb 
buds (Figure 4, 5), which was not observed in previous 
attempts to identify ds-regulatory regions that control 
Greml expression in limb buds [7,32], The fact that the 
GRS1 transgene recapitulates some aspects of this distal- 
anterior expansion is important, as Greml transcription 



normally expands anteriorly in register with posterior 
AER-FGFs and allows propagation of SHH/GREM1/AER- 
FGF signalling [14,30]. However, the delay in activation 
and lack of termination of the GRS1 transgene (Figure 4) 
indicates that other regulatory inputs from the Greml 
genomic landscape are required to regulate its dynamic 
expression in mouse limb buds. 

The GRS1 c/s-regulatory region integrates inputs from the 
SHH and BMP signalling pathways 

During limb bud initiation, mesenchymal BMP4 signal 
transduction is likely required to activate Greml expres- 
sion in the posterior mesenchyme, while SHH is primar- 
ily required for up-regulation and distal-anterior 
expansion during progression of limb bud development 
[11,14,43]. Furthermore, GLI3 in the anterior mesen- 
chyme and AER-FGF signal transduction are required to 
restrict and eventually terminate Greml expression 
(starting ~ El 1.5-1 1.75) [26,40]. Therefore, GRSl tgl0/+ 
embryos lacking key components of both the SHH and 
BMP signalling pathway in their limb buds were gener- 
ated to gain insight into the possible direct impact of 
these main signalling pathways on the GRS1 element 
(Figure 6). Analysis of Shh A/A GRSl tgl0/+ embryos 
revealed the complete absence of E-galactosidase activity 
in Shh -deficient forelimb buds (Figure 6A). This con- 
trasts with the endogenous Greml expression, which is 
activated but not maintained in <S7z/z-deficient limb buds 
[11,14]. These results show that activation of the GRS1 
region depends on SHH signalling, which indicated that 
it could also participate in ectopic Greml activation due 
to anterior ectopic SHH signalling in mouse [14] and 
chicken limb buds [12]. Furthermore, GLI3 proteins 
interact with specific cis- regulatory regions in the Greml 
genomic landscape [32] and genetic analysis has shown 
that GU3 is essential for the spatio-temporally controlled 
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restriction and subsequent termination of Greml expres- 
sion in the distal anterior mesenchyme [25,26,44]. In 
G//3-deficient GRSl transgenic (Gli3 xt/xt GRSl tgl0/+ ) 
forelimb buds, E-galactosidase activity in the posterior 
limb buds is comparable to GRSl tgl0/+ limb buds, while 
expression in the anterior mesenchyme is significantly 
increased by El 1.75 (Figure 6B, compare to right-most 
panels, Figure 4B). This late up-regulation of anterior 13- 
galactosidase activity indicates that GRSl is required for 
GLI3-mediated termination of Greml expression in the 
anterior limb bud mesenchyme, which is essential for 
the spatio-temporally correct initiation of mesenchymal 



condensations and chondrogenic differentiation of anter- 
ior digits [26]. 

BMPs control Greml activation in the posterior limb 
bud mesenchyme and directly modulate its expression as 
part of the self-regulatory SHH/GREM1/AER-FGF feed- 
back signalling system. Therefore, a conditional Smad4 
loss-of- function allele [45] in combination with the Prxl- 
Cre recombinase strain [46] was used to inactivate Smad4 
(Smad4 A/AM ) and thereby canonical BMP signal transduc- 
tion in the limb bud mesenchyme. In Smad4 A/AM mutant 
limb buds, endogenous Greml expression is down- 
regulated but not lost, while Shh expression remains 
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(Benazet et al, submitted). In Smad4 A/AM GRSl tgl0/+ fore- 
limb buds, ft-galactosidase activity appears normal in the 
posterior mesenchyme, while distal-anterior expansion 
fails to occur (Figure 6C, compare to Figure 4B). This 
indicates that GRS1 activity is modulated by SMAD4- 
mediated signal transduction during the progression of 
limb bud development These results indicate that also the 
distal-anterior expansion of Greml expression depends on 
BMP activity and agree with the observation that genetic 
lowering of Bmp4 results in a global reduction of Greml 
expression in the limb bud mesenchyme [11]. 

Finally, in Greml -deficient embryos, BMP activity is 
increased due to reduced BMP antagonism, which results 
in up-regulation of non-functional Greml transcript [13]. 
As BMP signal transduction modulates GRS1 activity 
during advanced limb bud development (Figure 6C), 
we also determined the potential influence of Greml 
deficiency on GRS1 activity. In Greml A/A GRSl tgl0/+ 
E10.5 limb buds E-galactosidase activity is initially similar 
to GRSl tgl0/+ limb buds (Figure 6D, compare to 
Figure 4B). Subsequently, E-galactosidase activity is 
increased and by E11.75 the entire Greml A/A GRSl tgl0/+ 
limb bud is positive (Figure 6D). These alterations of 
E-galactosidase activity in Smad4 A/AM and Greml A/A 
forelimb buds reveal that GRS1 activity is extensively 
modulated by changes in BMP activity, in particular 
also the anterior expansion of its expression. Taken 
together, this genetic analysis (Figure 6) shows that 
GRS1 activity critically depends on SHH and is 
modulated extensively by BMP signal transduction. 
This analysis identifies the GRS1, located -70 kb 
downstream of the Greml transcription unit as a cis- 
regulatory region that integrates inputs by both SHH 
and BMP signal transduction. However, as previous 
analysis provided good evidence that Greml is acti- 
vated by mesenchymal BMP signalling upstream of 
establishing the SHH/FGF feedback loop [11], these 
studies point to the existence of additional unknown 
BMP response regions that activate Greml expression 
in the posterior limb bud mesenchyme. 

Endogenous GLI3 proteins are part of the chromatin 
complexes interacting with specific parts of the GRS1 
region in mouse limb buds 

The observed loss of GRS1 activity in S7z/z-deficient limb 
buds (Figure 6A) indicated that the GRS1 region could 
be regulated directly by GLI proteins. Indeed, two GLI 
binding regions without consensus GLI binding motif 
(GBR1 and GBR2, [32]) were previously mapped to the 
9 kb GRS1 region using transgene-mediated expression 
of an epitope-tagged GLI3 transgene in combination 
with chromatin immunoprecipitation (ChIP). To study 
the potential interaction of endogenous GLI3 proteins 
with the GRS1 region in its normal genomic context in 



wild-type limb buds, ChIP analyses using antibodies that 
detect both the GLI3 activator and repressor protein iso- 
forms were performed [26,47]. GLI3 ChIP using extracts 
of wild-type and G//3-deficient limb buds at El 1.5 
revealed significant enrichment of one specific region 
within GRS1 by real-time qPCR analysis (Figure 7A). A 
significant ~4-fold enrichment of an amplicon located 
within HMCOl was observed by comparing wild- type to 
G//3-deficient limb buds (amplicon "d"; p = 0.008) and to 
a less conserved region (amplicon "e"; located outside 
HMCOl close to a putative GLI binding site; 
Figure 7A). In contrast, no significant enrichments of 
the other amplicons located in GRS1 were detected 
(Figure 7A). As the amplicons located in HMC02 
(Figure 7B) and HMC03 (Figure 7C) were also not sig- 
nificantly enriched, the GLI3-containing chromatin com- 
plexes appear to interact rather specifically with a region 
within the highly conserved HMCOl in wild- type limb 
buds. These results not only corroborates the identity of 
previously identified GBR1 [32], but indicate that this 
region mediates the effects of SHH signal transduction 
on GRS1 activity in the posterior limb bud mesenchyme. 

Spatio-temporal Greml expression is regulated by the 
interaction of multiple c/s-regulatory regions far 
downstream of the transcription unit 

Previous genetic analysis showed that Greml transcription 
in limb buds is initiated by BMP signalling and up- 
regulated under the influence of SHH and AER-FGF sig- 
naling as limb bud development progresses [11]. Finally, 
Greml expression is terminated concomitantly with the 
initiation of digit formation by high levels of FGF signal 
transduction and GLI3-mediated repression in the anter- 
ior mouse limb bud mesenchyme [26,40]. This dynamic 
spatio-temporal regulation of Greml expression indicates 
that the activity of different cis -regulatory elements may 
change over time. The spatio-temporal activity of the 
GRS1 transgene (Figure 4B) indicates that it is primarily 
regulated by the SHH/GREM1/AER-FGF feedback signal- 
ing system as limb bud development progresses. In par- 
ticular, its activity is first apparent when the SHH/ 
GREM1 /AER-FGF feedback is already established, and the 
expected termination does not occur, as E-galactosidase 
activity remains in the posterior mesenchyme after the en- 
dogenous Greml transcripts have been down-regulated 
(Figure 4B, D). These results indicate that FGF-mediated 
termination of Greml expression and the underlying 
self-regulatory feedback signalling system [40] does 
not occur by FGF signal transduction impacting the 
GRS1 region. In addition, neither the 70 kb critical region 
of the Greml landscape {Greml-LacZ, Figure 2A) nor 
transgenes derived from this region (Figures. 2, 3, 4 and 
ref. [32]) are able to accurately recapitulate the entire 
spatio-temporal distribution of Greml transcript in the 
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limb bud mesenchyme. Therefore, either the interactions 
among these elements or additional as yet unknown cis- 
regulatory regions located outside the critical region must 
also participate in the regulation of Greml expression. In- 
deed, there is circumstantial evidence for the latter, as 
deletion of the GC-rich Fmnl coding exon 9 (located 
-200 kb downstream of the Greml transcription unit) 
results in non-complementation with a Greml null allele, 
causing a hypomorphic Greml limb skeletal phenotype, 
and significantly reduced Greml expression (ref. [29] and 
A.Z and R.Z., unpublished data). 

Mapping of DNAse I hypersensitive (HS) sites is used 
to identify active regulatory regions in chromatin 
(reviewed in ref. [48]). Recently, the ENCODE consor- 
tium has done genome-wide mapping of HS sites in 
several mouse tissues including limb buds (ENCODE 
Group, University of Washington) [33,34]. Interestingly, 
this analysis revealed three HS sites within the GBRs of 
the GRS1 and HMC02, while no HS sites mapping to 
HMC03 were detected in limb buds at El 1.5 (Figure 8). 
The five HS sites mapped by the ENCODE consortium 
in the Greml genomic landscape overlap with the cis- 
regulatory elements required for Greml expression in 



mouse limb buds (Figure 8). The HS site mapping to 
HMCOl overlaps well with the amplicon enriched by 
GLI3-ChIP analysis (Figure 7A). In fact, all HS sites in 
the Greml genomic landscape overlap either with the 
GBR regions interacting with GLI3 [26,32] or CTCF 
binding sites [31]. Analysis of Ctcf- deficient limb buds 
revealed its requirement for up-regulation and distal- 
anterior expansion of Greml expression [31]. Interest- 
ingly, the HS site overlapping with GBR4 [32] is not con- 
served between mammals and birds (Figure 8 and data 
not shown). While the early GBR4 activity is comparable 
to GRS1 in the posterior mesenchyme (Figure 4B), it does 
not support distal-anterior expansion of the expression 
during subsequent development [32]. Thus, the inter- 
action of the GBR4 and GRS1 ds-regulatory regions could 
provide Greml expression with the necessary robustness 
and evolutionary plasticity, as has been postulated for the 
transcriptional regulation of HoxD genes by multiple 
interacting regulatory regions during limb bud develop- 
ment [8]. Taken together, these studies establish that the 
five HS sites in the Greml genomic landscape are func- 
tionally relevant for Greml expression in mouse limb 
buds, but not all of them are evolutionary highly 
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conserved. Indeed, experimental and comparative evolu- 
tionary evidence indicates that alterations in the spatio- 
temporal expression of Greml, and thereby the activity of 
the SHH/GREM1/AER-FGF feedback loop, likely contri- 
butes significantly to variations in digit numbers and 
morphologies (reviewed in ref. [49]). 

Conclusions 

In this study, we identify GRS1 as a distant ds-regulatory 
region that encompasses the highly conserved HMCOl 
region and show that it controls important aspects of 
Greml expression in the posterior and distal- anterior limb 
bud mesenchyme. GRS1 activity depends critically on 
SHH signalling and endogenous GLI3 protein complexes 
interact with specific regions within HMCOl in wild- type 
limb buds. In addition, the anterior expansion of GRS1 
activity is also regulated by BMP signal transduction. 
These results together with previous studies reveal the 
large genomic architecture composed of several distant 
ds-regulatory regions that control the highly dynamic 
Greml expression in response to signalling inputs from 
the BMP, SHH and FGF pathways. It is likely that the 
interactions among these dispersed ds-regulatory regions 
provide the dynamic Greml expression in limb buds with 
the necessary robustness, but at the same time allow for 
evolutionary plasticity of its expression [8]. 

Methods 

Ethics Statement 

All the experiments were conducted in strict accordance 
with Swiss law following the 3R principles and the con- 
duct defined by the Basel Declaration. All studies involv- 
ing mice were classified as grade zero by the Animal 



Welfare and Ethics Commission of both cantons of 
Basel and Argovia, which implies minimal suffering. 

Mouse strains 

The Shh loss-of- function allele [50], the GU3 extra-toes-] 
allele [51], the Greml null allele [13] and the GRSl tg/+ 
alleles (this study) were maintained in an NMRI back- 
ground, while the Smad4 conditional allele (Smad^ 10 *, 
[45]) and the Prxl-Cre transgene [46] were maintained 
in a C57BL/6 J background. 

BAC modifications 

BAC constructs were engineered using ET cloning as 
previously described [7]. The deletions AHMCOl, 
AHMC02 and AHMC03 were induced in the Greml- 
LacZ BAC clone using a Kanamycin resistance selection 
cassette. All primer sequences are available upon request. 
See Additional file 8 for the genomic coordinates of 
HMCOl-3. 

Rglob-LacZ transgene constructs 

An expression cassette was constructed in pKS-Bluescript 
using LacZeocin reporter under control of the minimal 
human ^-globin promoter sequences [52]. Various combi- 
nations of HMCO sequences were inserted downstream 
into this cassette. The HMCOl, -2 and -3 core sequences 
were amplified by PCR directly from mouse genomic 
DNA (primers available on request). The 9 kb GRS1 
region was initially subcloned as a 9.6 kb Bgl2 fragment 
from BAC RP23-113 H17 (chr2: 113,611,499-113,847,299 
(mmlO). BacPac Resources, Children's Hospital, Oakland, 
USA). All constructs were linearized with Kspl before 
microinjection. 



Zuniga et al. BMC Developmental Biology 2012, 12:23 
http://www.biomedcentral.com/1471-21 3X/1 2/23 



Page 13 of 15 



Generation of transgenic founder embryos and transgenic 
strains 

BAC and figlob-LacZ transgenic constructs were injected 
into the pronucleus of fertilized mouse eggs. Several 
founder embryos for each construct were scored for 
E-galactosidase activity (Additional file 3). For the 
GRSl-figlob-LacZ transgene, seven independent trans- 
genic mouse strains were established by crossing 
founders into the NMRI background. Transgene copy 
numbers were determined by real time qPCR [53] 
using the Bio-Rad CFX96 Real-Time PCR System in 
combination with the iQ SYBR Green Supermix 
(Bio-Rad). For each mouse, 20 ng of genomic DNA 
were analysed in triplicate. The primers to amplify 
the DBH genomic region were used for normalization 
and non-transgenic littermate DNA was used as control. 
The normalized control levels were set to 2 as the primers 
also amplified both alleles of the endogenous HMCOl 
regions. The 2" AACt formula was used to calculate and 
normalize transgene copy numbers. In addition to three 
different Fl males, three embryos from the F2 and three 
embryos from the F3 generation were analysed for the 
two transgenic strains shown to ascertain stable transmis- 
sion of the transgenes. Primers sequences are available on 
request. 

Detection of R-galactosidase activity 

Embryos were isolated in ice-cold PBS and staged 
according to somites numbers, then fixed in 1% formal- 
dehyde, 0.2% glutaraldehyde, 0.02% NP40, 0.01% sodium 
deoxycholate in lx PBS for 20-30 minutes at 4°C. Sub- 
sequently, embryos were washed three times in lx PBS 
for 5 minutes at room temperature and incubated over- 
night at 37°C and in the dark in 1 mg/mL X-Gal, 
0.25 mM K3Fe(CN6), 0.25 mM K4Fe(CN6), 0.01% NP- 
40, 0.4 mM MgCl2 and 1% sodium deoxycholate to de- 
tect 13-galactosidase activity, which colours cells blue. To 
stop the reaction, embryos were washed three times in 
lx PBS for 5 minutes each at room temperature. 

Chromatin Immunoprecipitation (ChIP) 

Forelimbs and hindlimbs of 10 wild-type or Gli3 xt/Xt 
embryos at El 1.5 were dissected and processed for ChIP 
as described [26] using a polyclonal anti-GLI3 antibody 
(#2676, Genentech, [47]). To compute the level of en- 
richment of a given region, the Ct values of both input 
and ChIP samples were compared with those of a nega- 
tive control amplicon located in the mouse fi-actin locus 
[21]. All results (mean±SD) were obtained by analysing 
three completely independent experiments per genotype. 
The significance of all differences was assessed using the 
two-tailed, non-parametric Mann-Whitney test. The 
coordinates of the relevant amplification are shown in 
Additional file 7. 



All genomic sequence alignements of the Fmnl-Greml 
locus region were performed using the ECR browser 
[54] and the ClustalW2 program [55]. 
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Additional file 1: Figure SI. Conservation of the HMCO core regions. 
ClustalW2 multiple sequence alignment of the HMC01, HMC02 and 
HMC03 core regions of mouse (mm 10), human (hg 1 9), chimpanzee 
(panTro3), dog (canFam2), bovine (bosTau6), opossum (monDom5), 
chicken (galGal3), lizard (anoCar2), frog (xenTro2), and coelacanth 
(LatChal) genomes The corresponding genomic coordinates are 
indicated. HMC01: 73 of 149 nucleotides were conserved in all species; 
HMC02: 182/298 conserved nucleotides, HMC03: 45/137 conserved 
nucleotides. 

Additional file 2: Table SI. Genomic coordinates for the sequence 
comparisons shown in Figure 1A. 

Additional file 3: Table S2. Analysis of transgenic founder embryo. 

Additional file 4: Figure S2. Limb bud mesenchymal expression of 
GRSl-Bglob-LacZ transgene in two independent transgenic mouse strains. 
(A) Expression of the GRSl-Bglob-LacZ transgene is restricted to limb 
buds, with expression initiating earlier in forelimb than hindlimb buds. 
Note the differences in B-galactosidase activity in GRSl tgW/+ (left panel) 
and GRS1 tg2/+ (right panel) transgenic mouse embryos. (B) Using real-time 
qPCR, the transgene copy numbers in both the GRSf gl0/+ and GRSf g2/+ 
mouse strains were determined in comparison to wild-type mice 
(carrying 2 copies of the endogenous GRS1 regions). This analysis 
revealed that the GRS1 tgW/+ strain carries 10 copies and GRSl tg2/+ 2 
copies of the transgene, respectively. 

Additional file 5: Figure S3. Comparison of the LacZ mRNA and S- 
galactosidase reporter activity in early limb buds of GRS1 tgl0/+ embryos. 
Distribution of B-galactosidase activity (A) and LacZ transcripts (B) in 
forelimb buds of GRSf gl0/+ embryos at E10.0 and E10.5. Arrowheads 
point to the posterior expression domains. 

Additional file 6: Figure S4. Dorso-ventral distribution of S- 
galactosidase activity in forelimb buds expressing the GR5f g2/+ 
transgene. The GRS1 tg2/+ transgene respects the dorsal and ventral 
restriction of the mesenchymal expression domains. Note that overall 
expression is significantly lower than in GRSl tgl0/+ limb buds (compare to 
Figure 5B). D: dorsal, V: ventral. 

Additional file 7: Table S3. qPCR amplicons for GLI3 ChIP analysis. 

Additional file 8: Table S4. Genomic coordinates of identified Greml 
regulatory regions. 
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